AD  P 001574 


ANALYZING  n SAMPLES 
OF  2 OBSERVATIONS  EACH 


J.  R.  Knaub,  Jr.,  L.  M.  Grile,  G.  Petet 
U.  S.  Army  Logistics  Center,  Ft.  Lee.VA 


■? 

i 


ANALYZING  n SAMPLES  OF  2 OBSERVATIONS  EACH 


I.  INTRODUCTION  (Modified  Westenberg  Tests  and  Length  of  Initial  Run  (LIR) 

Test)  (Knaub) 

II.  Mathematical  Theory  of  LIR  Test  (Knaub). 

j 

III.  Example  (Knaub). 

Appendix  I.  FORTRAN  Code  for  Modified  Westenberg  Tests  (Designed  for 
n pairs  of  observations)  (Knaub) 

Appendix  II.  Power  Tables  for  LIR  Test  (Knaub) 

a 

Appendix  III.  Discussion  of  Automated  LIR  Test  (ALIRT)  (Petet,  Grile) 
Appendix  IV.  Acknowledgements 
Appendix  V.  References 


23 


i MjBiaiiatMBliiMifliBa 


I.  Introduction. 

' When  data  are  scarce,  It  Is  common  to  combine  small  samples  from  a 
number  of  sources  considered  to  be  reasonably  similar.  When  sample  sizes  are 
extremely  small,  testing  the  assumption  of  similarity  of  sources  Is  often  only 
attempted  by  subjective  means.  This  paper  provides  a method  to  add  quantita- 
tive risk  assessments  to  the  study  of  this  assumption,  using  two  observations 
per  sample. 

In  addition  to  general  compatibility  testing  of  the  sources  using 
modified  versions  of  Westenberg* s Interquartile  Range  Test,  and  the  Westenberg-Mood 
Median  Test,  a new  hypothesis  test  has  been  developed  to  aid  In  Identi- 
fying whether  one  (or  more)  of  the  sources  of  data  provides  a substantially 
larger  or  smaller  set  of  values  due  to  Its  underlying  population. 

Because  the  probabilistic  risk  assessments  provided  here  address  a 
situation  so  commonly  found  In  analyzing  military  operations  as  well  as  In  test 
and  evaluation,  details  are  provided  to  simplify  the  Implementation  of  this 
methodology.  A major  goal  here  has  been  that  power  analyses  be  described  In 
terms  meaningful  to  the  user  and  the  decision  maker.  The  new  hypothesis  test 
makes  use  of  simulation-aided  power  analyses.  v 

The  tests  for  general  comparability,  using  modified  Westenberg  tests, 
were  first  Introduced  In  reference  2.  The  FORTRAN  code  for  these  tests  Is 
given  here  In  Appendix  I.  This  program  Involves  straight  forward  binomial 
probabilities.  The  first  format  statement  explains  the  variables  and  the  null 
and  alternative  hypotheses.  It  Is  written  In  terms  of  the  Interquartile  range 
of  the  combined  sample,  but  could  easily  be  written  In  terms  of  being  above  or 
below  the  median  of  the  combined  sample.  The  null  hypothesis  (Hq)  and  the 

alternative  (H^)  are  repeated  here.  "Ho:  Each  sample  has  at  least  100  x RA 

percent  chance  of  having  one  observation  Inside  and  one  observation  outside  the 
Interquartile  range.  H^:  Each  sample  has  at  least  100  x RB  percent  chance  of 

having  both  observations  fall  together  either  Inside  or  outside."  The  null 
hypotheses  for  the  Westenberg  tests  thus  Indicate  a general  compatibility  among 
the  data  sources.  If  a set  of  data  made  up  of  pairs  of  observations  from  a 
number  of  sources  appear  to  be  reasonably  homogeneous  as  judged  by  these  tests, 
however,  It  may  still  be  that  one,  or  perhaps  a few  pairs  of  observations  may 
have  been  drawn  from  a source  very  different  from  the  underlying  population  for 
the  majority  of  the  pairs  of  observations.  Therefore,  a new  test  Is  needed  due 
to  the  Inadequacy  of  the  modified  Westenberg  tests  to  discern  such  a situation. 

In  order  to  determine  whether  a pair  of  observations  may  have  values 
appreciably  larger,  or  smaller  than  the  other  observations,  the  probability  of 
having  both  members  of  that  pair  of  observations  be  among  the  largest,  or 
smallest  In  the  combined  sample  should  be  investigated.  This  Is  accomplished 
here  in  the  Length  of  Initial  Run  (LIR)  Test.  In  this  test,  the  larger  value 
In  each  pair  of  observations  Is  labeled  "A,"  while  the  smaller  of  the  observa- 
tions from  that  source  Is  labeled  "B."  If  no  pair  of  observations  Is  drawn 
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from  an  underlying  population  considerably  different  from  the  others  (in 
pari  cular  If  no  pair  of  observations  Is  drawn  from  an  underlying  population 
whost:  location  is  considerably  larger,  or  smaller  than  the  others)  then  the 
length  of  the  run  of  A's  In  the  combined  sample  before  the  first  B and  the 
length  of  the  run  of  B's  before  the  first  A should  not  be  too  short.  (Note 
that  run  lengths  can  vary  from  one  to  n.) 

II.  Mathematical  Theory  of  LI R Test. 

Under  the  null  hypothesis,  that  all  pairs  of  observations  were  drawn 
from  the  same  or  identical  populations,  the  length  of  the  Initial  run  of  A's  Is 
identically  distributed  as  the  length  of  the  initial  run  of  B's.  This  discus- 
sion will  only  be  couched  in  terms  of  the  Initial  run  of  A's. 

The  probability  of  having  an  "A"  as  the  largest  value  in  the  combined 
sample  is  unity.  The  probability  of  having  an  "A"  In  the  second  largest 
position,  under  H , Is  the  number  of  observations  not  Included  In  the  sample 

pair  that  the  first  A came  from  (2n-2),  divided  by  the  total  remaining  number 
of  observations  available  (2n-l).  The  probability  of  having  a third  A In  a row 
Is  the  previous  probability  multiplied  by  the  number  of  observations  not 
included  in  either  sample  pair  that  the  first  two  A's  came  from  (2n-4),  divided 
by  the  total  remaining  number  of  observations  available  (2n-2).  Therefore,  the 
probability  (under  H ) of  having  at  least  r of  the  A's  before  the  first  B is: 

, 2n-2  \ , 2n-4  , 2n-6  ^ , 2n-2r+2  N 

( ) ( ) ( — — )••••(  ) 

2n- 1 2n-2  2n-3  2n-r+l 


If  exactly  r of  the  A's  preceed  the  first  B,  the  probability  of  this  occurrence 
(under  Hq)  is  the  above  expression  multiplied  by  the  probability  that  the  r+lth 

largest  value  is  a B.  This  would  be  the  number  of  B values  whose  corresponding 
A value  is  among  the  members  of  the  initial  run  (r),  divided  by  the  total 
remaining  number  of  observations  (2n-r).  Simplifying,  therefore,  the  probabil- 
ity (under  Hq)  of  having  an  initial  run  of  length  r is: 


’2n-2) ! ! (2n-r-l) ! r 


(2n-2r)!!(2n-l)! 


, where  k! ! = k( k-2)( k-4^  • . . . 

(stopping  at  2 if  k is  even,  or  1 if  k is  odd). 


This  further  simplifies  to 
( n-1) !2n~1  ( 2n-r-l) ! r 

(n-r) !2n~r  (2n-l) ! 


'n-l)!2r_1  (2n-r-l)!r 


(n-r)!  (2n-l) ! 


In  general,  if  N is  the  number  of  observations  per  sample  and  n is  the  number 
of  samples,  then  the  probability  of  an  initial  run  of  length  r is: 

(n-1) !Nr-1  (Nn-r-1) ! (N-l)r 


(n-r)!  (Nn-1) ! 


Simulations  could  be  used  for  alternative  hypotheses  and  for  Irregular  numbers 
of  observations.  This  paper,  however,  Is  concerned  with  N=2  observations  per 
sample. 


Simulations  (see  Appendix  III)  were  used  to  determine  the  relative 
frequency  distribution  of  initial  run  lengths  under  various  alternative  hypo- 
theses. Each  alternative  studied  assumed  one  pair  of  observations  to  be  taken 
from  one  underlying  population  and  all  others  taken  from  a second  underlying 
population,  with  a few  exceptions  for  sensitivity  study  purposes.  The  accuracy 
of  the  simulations  was  examined  in  several  manners.  First,  both  underlying 
populations  were  set  Identical  and  the  results  compared  to  the  frequency 
distribution  for  the  null  hypothesis.  Agreement  here  demonstrated  that  the 
closed  form  solution  for  the  null  distribution  is  correct  and  also  that  the 
simulation  was  accurate  to  approximately  three  significant  digits  using  20,000 
replications  for  the  cases  of  interest  shown  in  Appendix  II,  also  under  Hq. 

However,  under  any  alternative  hypothesis,  accuracy  of  the  simulation  is 
degraded  due  to  the  fact  that  the  distributional  forms  which  the  pairs  of 
observations  are  being  drawn  from  are  not  exactly  what  they  have  been  repre- 
sented to  be.  Table  I,  however,  provides  a set  of  chi-square  "poorness"  of  fit 
tests  which  show  that,  In  the  case  investigated  there,  the  distribution  Is 
almost  exactly  as  was  represented.  (Similar  results  were  obtained  using  other 
distributions.)  Table'll  is  used  to  demonstrate  the  small  differences  in 
resulting  output  when  inputs  are  varied  to  degrees  that  were  found  unlikely  to 
actually  occur.  (Note  that  the  differences  found  in  Table  II  were  of  only 
approximately  the  same  magnitude  as  in  Table  I.)  From  this,  It  is  generally 
concluded  that  only  two  significant  digits  should  be  used  from  the  relative  and 
cumulative  relative  frequency  outputs. 

In  addition  to  the  type  of  validation  shown  above,  the  simulation 
results  were  compared  to  a closed  form  solution  for  the  probability  of  a run  of 
length  one  when  one  pair  of  observations  is  drawn  from  one  distribution  and  all 
others  from  a second  distribution.  In  order  for  there  to  be  a run  of  length 
one,  both  observations  from  the  same  pair  must  be  the  two  largest  (or  smallest) 
observations  in  the  combined  sample.  Therefore,  if  p is  the  probability  of  a 
run  of  length  one,  and  we  are  investigating  the  initial  run  of  A's,  and  only 
one  pair  of  observations  is  drawn  from  one  distribution  with  all  others  drawn 
from  a second  distribution, 
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TABLE  la 


Random  Nos. 

True  Generated  From  A 


N(12,l) 

VS 

N(12,l) 

# 

#0BS 

CELL# 

#0BS 

CELL# 

#OBS 

CELL# 

#OBS 

1 

370 

26 

410 

51 

417 

76 

405 

2 

375 

27 

428 

52 

402 

77 

403 

3 

402 

28 

382 

53 

398 

78 

405 

4 

369 

29 

413 

54 

381 

79 

360 

5 

412 

30 

386 

55 

447 

80 

361 

6 

376 

31 

400 

56 

385 

81 

393 

7 

407 

32 

396 

57 

393 

82 

388 

8 

424 

33 

426 

58 

375 

83 

408 

9 

413 

34 

386 

59 

404 

84 

381 

10 

381 

35 

433 

60 

401 

85 

388 

11 

377 

36 

380 

61 

419 

86 

404 

12 

380 

37 

407 

62 

371 

87 

388 

13 

394 

38 

418 

63 

414 

88 

408 

14 

381 

39 

393 

64 

400 

89 

421 

15 

382 

40 

407 

65 

415 

90 

411 

16 

364 

41 

370 

66 

396 

91 

422 

17 

388 

42 

428 

67 

415 

92 

397 

18 

398 

43 

411 

68 

429 

93 

395 

19 

423 

44 

395 

69 

389 

94 

398 

20 

427 

45 

405 

70 

424 

95 

394 

21 

381 

46 

347 

71 

417 

96 

414 

22 

398 

47 

413 

72 

406 

97 

429 

23 

393 

48 

445 

73 

439 

98 

392 

24 

394 

49 

415 

74 

404 

99 

420 

25 

370 

50 

392 

75 

392 

100 

417 

THE  CHI-SQUARE  VALUE  FOR  A N(12,l) 
USING  40000  GENERATED  RANDOM  NUMBERS 
TESTED  AGAINST  A N(12,l) 

IS  94.56 


TABLE  lb 


Random  Nos. 

True  Generated  From  A 

N(12,l)  VS  N( 12, 0.95) 


CELL  # 

#OBS 

CELL# 

#0BS 

CELL# 

#0BS 

CELL# 

#0BS 

1 

282 

26 

385 

51 

449 

76 

437 

2 

295 

27 

426 

52 

415 

77 

390 

3 

317 

28 

430 

53 

418 

78 

349 

4 

344 

29 

403 

54 

402 

79 

387 

5 

343 

30 

424 

55 

468 

80 

410 

6 

372 

31 

409 

56 

395 

81 

394 

7 

361 

32 

424 

57 

426 

82 

399 

8 

380 

33 

424 

58 

385 

83 

393 

9 

390 

34 

409 

59 

435 

84 

392 

10 

402 

35 

433 

60 

417 

85 

389 

11 

378 

36 

417 

61 

444 

86 

401 

12 

362 

37 

412 

62 

387 

87 

409 

13 

370 

38 

445 

63 

413 

88 

406 

14 

396 

39 

424 

64 

454 

89 

394 

15 

382 

40 

403 

65 

399 

90 

414 

16 

383 

41 

426 

66 

439 

91 

375 

17 

362 

42 

401 

67 

435 

92 

368 

18 

369 

43 

458 

68 

416 

93 

382 

19 

412 

44 

399 

69 

432 

94 

366 

20 

436 

45 

432 

70 

441 

95 

383 

21 

438 

46 

387 

71 

424 

96 

366 

22 

385 

47 

418 

72 

451 

97 

367 

23 

388 

48 

460 

73 

418 

98 

335 

24 

416 

49 

442 

74 

398 

99 

354 

25 

396 

50 

415 

75 

414 

100 

300 

THE  CHI-SQUARE  VALUE  FOR  A N( 12,0.95) 
USING  40000  GENERATED  RANDOM  NUMBERS 
TESTED  AGAINST  A N( 12 , 1 ) 

IS  302.45 


TABLE  Ic 


True 


Random  Nos. 
Generated  From 


CELL 


N(12,l) 

VS 

N( 12 , 1 . 05) 

# 

#0BS 

CELL# 

#OB5 

CELL# 

#0BS 

CELL# 

#0BS 

1 

495 

26 

418 

51 

402 

76 

393 

2 

443 

27 

366 

52 

371 

77 

384 

3 

448 

28 

407 

53 

388 

78 

398 

4 

445 

29 

378 

54 

372 

79 

398 

5 

412 

30 

386 

55 

398 

80 

359 

6 

442 

31 

367 

56 

396 

81 

342 

7 

444 

32 

427 

57 

366 

82 

385 

8 

428 

33 

356 

58 

372 

83 

401 

9 

399 

34 

423 

59 

370 

84 

392 

10 

395 

35 

352 

60 

383 

85 

380 

11 

396 

36 

386 

61 

387 

86 

398 

12 

399 

37 

423 

62 

400 

87 

396 

13 

378 

38 

385 

63 

358 

88 

402 

14 

384 

39 

362 

64 

368 

89 

422 

15 

354 

40 

385 

65 

422 

90 

417 

16 

404 

41 

365 

66 

385 

91 

433 

17 

411 

42 

403 

67 

376 

92 

433 

18 

411 

43 

394 

68 

421 

93 

418 

19 

413 

44 

387 

69 

381 

94 

416 

20 

383 

45 

360 

70 

382 

95 

427 

21 

373 

46 

345 

71 

404 

96 

424 

22 

392 

47 

392 

72 

411 

97 

473 

23 

392 

48 

427 

73 

392 

98 

479 

24 

368 

49 

391 

74 

431 

99 

472 

25 

391 

50 

380 

75 

391 

100 

566 

THE  CHI-SQUARE  VALUE  FOR  A N( 12,1.05) 
USING  40000  GENERATED  RANDOM  NUMBERS 
TESTED  AGAINST  A N(12,l) 

IS  277.28 
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TABLE  I la 


Input  distributions  are  N(12.00,  1.00)  and  N(10.00,  1.00).  Number  of  samples 
from  each  distribution  Is  1 and  9 respectively.  The  random  number  seed  for 
this  run  Is  65557. 


Test  for  the  Length  of  the  Initial  Run  of  A's  before  the  first  B. 
Number  of  replicationr.20000 


Length 

Observed 

Relative 

Cumulative 

of  Run: 

Frequency: 

Frequency: 

Frequency: 

1 

7275 

0.363750 

0.363750 

2 

4391 

0.219550 

0.583300 

3 

2980 

0.149000 

0.732300 

4 

2123 

0.106150 

0.838450 

5 

1534 

0.076700 

0 915150 

6 

857 

0.042850 

0.958000 

7 

517 

0.025850 

0.983850 

8 

246 

0.012300 

0.996130 

9 

67 

0.003350 

0.999500 

10 

10 

0.000500 

1.000000 
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TABLE  lib 


W 


Input  distributions  are  N(12.00,  0.95)  and  N(10.00,  1.00).  Number  of  samples 
from  each  distribution  Is  1 and  9 respectively.  The  random  number  seed  for 
thl s run  Is  65557. 

Test  for  the  Length  of  the  Initial  Run  of  A's  before  the  first  B. 

Number  of  replications: 20000 


Length 

Observed 

Relative 

Cumulative 

of  Run: 

Frequency: 

Frequency: 

Frequency: 

1 

7406 

0.370300 

0.370300 

2 

4480 

0.224000 

0.594300 

3 

3018 

0.150900 

0.745200 

4 

2134 

0.106700 

0.851900 

5 

1419 

0.070950 

0.922850 

6 

806 

0.040300 

0.963150 

7 

469 

0.023450 

0.986600 

8 

203 

0.010150 

0.996750 

9 

55 

0.002750 

0.999500 

10 

10 

0.000500 

1.000000 
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TABLE  lie 


Input  distributions 
from  each  dlstrlbutl 
this  run  Is  65557. 


are  N(12.00,  1.05)  and  N(10.00,  1.00) 
on  Is  1 and  9 respectively.  The  random 


Number  of  samples 
number  seed  for 


Test  for  the  Length  of  the  Initial  Run  of  A's  before  the  ftrst  B. 
Number  of repiicationa: 20000 


Length 
of  Run: 
1 
2 

3 

4 

5 

6 

7 

8 
9 

10 


Observed 

Frequency: 

7141 

4273 

2969 

2119 

1611 

958 

556 

284 

78 

11 


Relative 

Frequency: 

0.357050 

0.213650 

0.148450 

0.105950 

0.080550 

0.047900 

0.027800 

0.014200 

0.003900 

0.000550 


Cumulative 

Frequency: 

0.357050 

0.570700 

0.719150 

0.825100 

0.905650 

0.953550 

0.981350 

0.995550 

0.999450 

1.000000 
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where  ^ ^(x)  the  distribution  of  the  k — order  statistic  out  of  2 

observations  in  distribution  1 (distribution  1 is,  in  general,  the  distribution 
of  larger  location  if  the  initial  run  of  A's  is  being  Investigated);  and  gg  m ^ 

is  the  distribution  of  the  k — order  statistic  out  of  m for  distribution  2. 
Since  there  are  n-1  pairs  of  observations  taken  from  distribution  2,  m = 2(n-l) 
= 2n-2,  and  therefore: 


m-1  2n-2-l  2n-3 

The  expression  for  p is  based  on  the  fact  that  if  both  observations  of 
a given  pair  have  larger  values  than  any  other  observation  in  the  combined 
sample,  then  the  B value  associated  with  the  largest  A has  to  be  larger  than 
all  other  2n-2  values. 

Once  again,  use  of  this  validation  technique  supported  the  conclusion 
that  two  significant  digits  should  be  used  in  the  results. 

If  fj  (x)  is  the  density  function  for  distribution  1,  and  fg  (x)  for 

distribution  2,  an  approximation  can  be  made  for  p when  the  number  of  observa- 
tions drawn  from  each  distribution  are  equal,  or  nearly  equal,  and  very  small. 
This  approximation  will  be  very  poor  unless  the  assumptions  are  enforced*  In 
general,  however,  the  calculations  are  much  easier  than  those  in  the  previous 
expression.  In  this  case  the  approximation  is  as  follows: 


*co  »oo 

= J f2(x)  j f]  (t)dtdx;  4>  * <#>/ (1  -«j 


x = -» 


t = x 


/ \ / $ \ , / 2o-v  \ / 1 \ 

" 'v$+2n-v' ' (v-1  )$+2n-v'  'v$+2n-v' ' v*+2n-v-1 ' 

where  v is  the  number  of  observations  taken  from  tlv»  first  distribution. 

<j>  is  the  probability  that  if  one  observation  were  drawn  from  each  of  the 
two  distributions,  the  observation  from  distribution  1 would  have  a larger 
value.  The  approximation  is  therefore  a weighted  counting  procedure  which  doe* 
not  fully  account  for  the  shapes  of  the  true  distributions  of  interest. 

The  tables  of  Appendix  II  provide  power  information  for  a variety  of 
cases  when  the  number  of  samples  (of  size  two  each)  is  5,  10,  20,  and  50.  The 
alternative  hypotheses  could  have  more  than  two  underlying  distributions  (up  to 
n)  but  two  are  sufficient  to  illustrate  what  is  being  investigated  here.  The 
importance  of  this  test  is  to  determine  the  likelihood  of  having  one  (or 
possibly  more  than  one)  pair  of  observations  drawn  from  an  underlying  popula- 
tion which  is  substantially  different  from  the  underlying  population  from  which 
the  rest  of  the  observations  were  drawn. 
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If  sample  sizes  are  large,  It  is  possible  to  reject  Hq  when  the  truth 
Is  close  enough  to  Hq  for  practical  purposes,  unless  a sDecIfic  is  used  for 

a power  analysis.  When  sample  sizes  are  small,  as  in  the  case  here,  one  may 

fall  to  reject  H when  the  truth  is  not  close  to  H . Power  analyses  would  help 
o o 

by  completing  the  quantification  of  the  problem.  Without  a power  analysis,  a 
hypothesis  test  Is  only  half  completed.  Null  and  alternative  hypotheses  work 
In  pairs  analogously  to  confidence  limits.  In  the  present  situation,  a power 
analysis  Is  very  important  due  to  the  unconventional  nature  of  the  problem. 

(It  is  Interesting  to  note  that  when  the  number  of  samples  [of  size  two  each] 
Increases,  the  power  level  at  a given  significance  level  remains  apparently 
approximately  constant  [see  Appendiix  II].) 

This  paper  makes  use  of  a much  neglected  application  for  simulation. 
Simulation  can  be  used  as  a check  for  a closed  form  solution  when  the  develop- 
ment of  such  a solution  was  subtle,  in  addition  to  handling  situations  where  a 
closed  form  solution  is  difficult  if  not  impossible. 

III.  Example. 


Suppose  that  ten  processes  (or  items  of  equipment,  etc.)  are  to  be 
examined  for  a certain  trait  and  that  it  is  expected  that  they  will  all  behave 
similarly  for  that  trait.  Also,  suppose  that  the  expense  involved  in  studying 
those  processes  is  great,  or  that  for  some  other  practical  reason,  the  number 
of  observations  per  process  (item,  scenario,  etc.)  must  be  kept  extremely 
small.  If  two  observations  each  are  used,  a combined  sample  size  of  20  is 
obtained.  Whether  or  not  these  samples  should  be  combined  would  then  be  open 
to  examination.  In  addition  to  any  subjective  arguments,  the  modified  Westen- 
berg  tests  and  the  LI R test  should  be  applied  to  assist  in  this  examination. 
Suppose  (for  use  in  the  LIR  test)  that  the  larger  value  in  each  pair  of  obser- 
vations is  labelled  "A,"  and  the  smaller  values  labelled  "B."  Subscripts 
"1"-"10"  could  be  used  to  denote  which  process  is  being  represented.  (This 
will  be  used  in  the  modified  Westenberg  tests.)  Suppose  that  when  the  values 
for  each  of  these  observations  are  ranked  from  largest  to  smallest  the  follow- 
ing result  is  obtained: 


A8  B8  A7 


i 

A 1 
*1  « 
i 

times 


A4  A5 


A6  A10 


B2  A9  B7  B4 


bi! 


B6  B10  B3  B9  B5 


The  number  of  times  that  a pair  of  observations  are  both  found  on  the  same  side 
of  the  median,  NZ,  is  2.  Also,  the  number  of  times  that  a pair  of  observations 
are  both  found  either  inside  or  outside  the  interquartile  range  is  NZ=2.  Table 
III  shows  results  taken  from  the  program  of  Appendix  I.  PA  is  the  probability 


level  of  the  test  associated  with  H 


ated  with  H^.  An  examination  of 


o’ 
thi  s 


and  PB  is  the  probability  level  associ- 
table  shows  that  there  is  no  good  reason, 


based  on  this  ranked  data  and  aside  from  subjective  arguments,  to  conclude  that 
these  samples  should  not  be  combined,  but  actually  there  is  good  reason  to 
conclude  that  such  a combination  Is  advisable.  However,  the  LIR  test  can  be 
used  to  show  that  even  though  general  compatibility  appears  evident,  the 
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TABLE  III 


RUN  U C 

ho:  EACH  SAMPLE  HAS  AT  LEAST  IOOxRAV.  CHA'iCE  OF  HAVING 

ONE  OBSERVATION  INSIDE  AND  ONE  OBSERVATION  OUTSIDE 
THE  INTERQUARTILE  RANGE. 

HI!  EACH  SAMPLE  HAS  AT  LEAST  JOOxRBI'.  CHANCE  OF  HAVING 
BOTH  OBSERVATIONS  FALL  TOGETHER 
EITHER  INSIDE  OR  OUTSIDE 


INPUTS  are: 

NS,  THE  NUMBER  OF  SAMPL^„ 

NL.  - • ,JE  NUMBER  OF  ZEROES 
RA  AND  RB 

THE  NUMBER  OF  ZEROES  IS  THE  NUMBER  Or  SAMPLES  WHOSE  TWO  OBSERVATIONS 
ARE  FOUND  TOGETHER 


ENTER  NS , NZ , RA , RB 

10,2,0.95,0.50 
PA=0.2262  PB=0. 1875 

Do  you  wisn  to  run  tne  test  asainr 
Enter  "Y"  or  "N". 

Y 


ENTER  NS»NZ,RA,RB 

10,2,0.50,0.50 
PA=0.9G88  . PB=0 . 1 875 

Do  you  wish  to  run  the  test  asain? 
Enter  "Y"  or  "N". 

Y 


ENTER  NS,NZ,RA,RB  . 

10,2,0.75,0.25 
PAa0.7G27  ?3a0.G328 

Do  you  wish  to  run  tne  test  asain? 
Enter  "Y"  or  MN". 

N 

FORTRAN  STOP 


i 

s 
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GRAPH 


Initial  run  of  A's  Is  extremely  short  (one)  and  this  Indicates  that  sample  8 
may  provide  values  too  large  in  comparison  to  the  other  values.  The  LIR  test 
shows  that  If  the  null  hypothesis  (that  all  pairs  of  observations  were  drawn 
from  the  same  underlying  distribution)  is  true,  the  probability  of  an  initial 
run  of  length  one  Is  0.053  when  the  number  of  samples  (n)  is  10.  This  Is  the 
probability  of  a Type  I error.  However,  if  the  null  hypothesis  Is  "accepted" 
here,  the  probability  of  a Type  II  error  (for  any  alternative)  Is  unity  since 
no  amount  of  evidence  against  HQ  would  then  suffice.  If  the  values  of  the 

observations  here  suggest  that  graph  A is  the  result  of  a reasonable  alterna- 
tive hypothesis,  then  the  power  against  that  alternative  Is  of  Interest  and  Is 
approximately  0.56.  Against  the  alternative  used  to  arrive  at  graph  B,  the 
power  Is  0.36,  and  the  power  from  graph  C would  be  0.12.  Note,  however,  that 
the  alternative  shown  here  against  which  the  power  is  lowest  Is  for  a situation 
better  Investigated  by  the  tests  of  Appendix  I,  program  WC,  originally  found  in 
reference  2.  Note  also  that  even  If  all  20  observations  here  came  from  the 
same  distribution,  that  is  a small  sample  to  use  to  determine  the  form  of  that 
distribution.  Therefore,  other  alternatives  Involving  distributional  forms 
other  than  normality  may  be  needed  to  complete  the  analysis. 

Suppose  Bg  and  A^  were  not  as  shown,  but  exchanged  in  position.  The  values 

of  NZ  would  remain  the  same,  but  the  length  of  the  initial  run  of  A's  would  now 
be  two.  If  Ho  is  rejected  when  the  observed  run  length  Is  2,  the  probability 

of  a Type  I error  is  0.158.  Against  the  first,  second,  and  third  alternatives 
mentioned  above,  the  probability  of  a Type  II  error  would  be  0.44,  0.64,  and 
C.6S,  respectively,  and  the  power  would  be  0.7f , U.5E,  ar.d  0.34,  also 
respectively. 

If  a run  of  length  3 is  considered,  these  figures  are,  respective  to  the 
order  given  above,  0.307,  0.24,  0.42,  0.66,  0.87,  0.73,  and  0.62. 

Considering  the  above,  when  n=10  It  could  be  deemed  reasonable  to 
reject  Hq  when  the  run  length  is  2 or  less,  and  "accept"  It  when  It  Is  3 or 

more. 

In  conclusion,  if  some  of  the  observations  in  the  example  given  earlier 
are  shifted  In  rank  It  may  affect  one  or  more  or  perhaps  none  of  these  tests. 
Also,  If  the  modified  Westenberg  tests  greatly  discourage  the  combining  of  the 
samples,  then  the  LIR  test  will  probably  not  be  very  useful.  In  using  the 
modified  Westenberg  tests,  If  it  is  not  possible  to  divide  the  observations 
Into  groups  of  equal  size  (above  and  below  the  median  and  inside  and  outside  of 
the  interquartile  range),  then  apply  the  tests  shifting  the  observatlon(s) 
which  are  In  question  from  one  possible  grouping  to  the  other  and  average  the 
results  obtained.  Finally,  because  the  Bernoulli  trials  In  the  modified 
Westenberg  tests  used  here  are  not  truly  independent,  these  tests  are 
approximate.  Also,  some  RA  and  RB  values  may  be  inappropriate.  For  example, 

In  Table  III,  RA  could  never  be  0.95;  however,  It  Is  used  to  Illustrate  the 
strong  conclusiveness  of  these  particular  results.  A study  of  the  null 
hypothesis  Indicates  that  for  RA  = 0.5,  the  binomial  distribution  used  for  this 


test  has  thicker  "tails"  than  warranted, 
described  by 


(The  exact  null  distribution  Is 


_,(NS!/2  NZ/2)2 
21  ,NS 


(2NS) !/2r 


NS!/(NS-NZ)i  . \ 

o * • / 


[(NZ/2)!] 


trU!  ncl  d^stribut1ofl  Is  skewed  to  the  right,  but  not  appreciably 
nf  e?K6edH  5°;  For,sma11er  numbers  of  samples,  WB  1 from  reference  2 is 
°f.  USl e\ Jh®  advantages  In  using  program  WC  are  that  understandable 
alternative  hypotheses  can  be  shown  for  decision  making,  and  a large  number  of 

nppH  h ,3airStCK?  be,ha"dled  with  very  little  computer  time.  All  that  Is  really 
needed  1S  a table  of  the  cumulative  binomial  distribution.  This  test  should  be 

testldeThP  itr1^  fnd  dirty  aS  3 Pre11n,1'nar>y  t0  the  Implementation  of  the  LIR 
test.  The  LIR  test  is  an  exact  test  and  is  easily  and  meaningfully  applied 
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APPENDIX  I 


FORTRAN  CODE  FOR  Modified  Westenberg  Test 
(Designed  for  n pairs  of  observations) 

This  program  was  referred  to  as  "WC"  In  reference  2.  Note  that  unlike  the 
other  modified  Westenberg  tests  of  reference  2,  which  are  exact,  this  one  Is  an 
approximate  tost. 
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*mmpgmlppp*T 


1100 

/ 

/ 

/ 

/ 

/ 

/ 

/ 

/ 

/ 

1000 

999 


2 


3 


5 


6 

100. 

2000 

2001 

2002 

i 

2003 

2010 


CHARACTERM  A:,S.»Ek 

j ioo) 

FORMA!  <SX, 'ilO:  EACH  SAMPLE.  HAS  AT  LEAST  lOOxRAl  CHANCE', 

' OF  H A V 1 G ' , / , H X r ' 0 N K OBSERVATION  INSIDE  AND  ONE' 

' ObStR  V A T 1JN  ilU T S 1 D E * , / , 9 X , ' THE  INTERQUARTILE  RANGE,', 
//,bX,'Hl':  EACH  SAMPLE  HAN-  AT  LEAST  lOOxRBI  CHANCE', 

' OF  HAVING', /,9X,'bOTH  OBSER  VA  I IOhS  FALL  TOGETHER', 

/,9X, 'EITHER  INSIDE  UR  OUTS  I DE ',//////, 5X INPUTS  ARE:  ', 
/,5X,'NS,  THE  NUMBER  OF  SAMPLES',/, 

SX,'NZ,  THE  NUMBER  OF  ZEROES', 

/,5X,'KA  AND  RB',//,bX, 'THE  NUHeER  OF  ZEROES  IS  THE  NUMBER  OF', 
' SAMPLES  NHQSE  TWO  OBSERVAT  IONS S X , 'ARE  FOUND  TOGETHER') 
wRITF(b,99«) 

FORMAT  (//,*>X, 'ENTER  NS , NZ , R A , RB  ' ) 

READC5,*).  S,t.Z,«A,KB 

IsNS/2 

JsfiZ/k 

PA  = 0 

PB  = 0 

K = 0 

X=  I 

Y=K 

W=0.5*I 

I F C Y , GE , w j GuTU  100 
P=X 

P=P/CX-Y) 

T=0 
T = T + 1 
P=P*(X-T) 

P=P/(X-Y-T) 

L=X-Y-T 

IF ( L , GT , 1 ) GOTO  5 

IF ( K , GT , J ) GOTO  fa  

PB=PB+P*( CHB**K)*((1-RB)**CI-K))) 

KsK+l 

IF ( K , LE , J ) GOTO  2 
K = K - 1 

PAsPA  + P* ( ( ( 1-RA)*»K)*(RA**II-K) ) ) 

K = K+ 1 _ 

IF ( K~1 ) 2 , 2 , 2000 
Y = X-Y 
GOTO  3 

*RlT£(6,2O01)PA,PB 

FORMAT ( IX, 'PA=',Eo.4,5X,  'PB  = ',Fb, 4) 

*RITE(6,2002) 

form. at (T2 , 'Do  you  wish  to  run  the  test  again?',/, 

T2, 'Enter  "Y"  or  " N " , ' ) 

READ(t>,20u3)AsSnER 
FORMAT! A ) 

IFCANSWER.EO,  'Y'JGOTO  1000 

STOP 

END 
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APPENDIX  II 


POWER  TABLES  FOR  LIR  TEST 


The  distributions  from  which  the  input  is  to  be  drawn  for  each  of  the 
alternative  hypotheses  are  shown  here  followed  by  histograms  of  the  relative 
frequency  distributions  for  these  alternatives.  In  all  cases  here,  both  of  the 
distributions  from  which  the  samples  are  drawn  are  of  the  same  type  but  with 
parameters  which  differ  In  sone  respect.  The  samples  could  have  been  drawn 
from  totally  different  distributional  forms,  and  more  than  two  such  distribu- 
tions could  have  been  used  (up  to  n);  however,  what  is  used  here  Is  sufficient 
to  demonstrate  this  test  under  conditions  which  Illustrate  its  usefulness. 

In  the  case  of  the  normal  distribution,  when  standard  deviations  are  the 
same,  the  symmetric  nature  makes  the  relative  frequency  distribution  for  the 
initial  run  of  A's  the  same  as  that  for  the  initial  run  of  B's,  if  the  number 
of  pairs  of  observations  from  each  of  the  two  input  distributions  Is  inter- 
changed. This  is  true  in  all  cases  where  symmetric  input  distributions  of 
equal  variance  are  used.  Also,  when  two  symmetric  input  distributions  with  the 
same  location,  but  unequal  variances  are  used,  this  principle  applies.  When- 
ever this  occurs,  the  output  relative  frequency  distributions  here  are  written 
in  terms  of  the  initial  run  of  A's. 

The  Church-Harri s-Downton  (C-H-D)  method  of  testing  the  probability  of 
motor  case  rupture  in  missile  testing,  see  reference  1,  makes  use  of  a statis- 
tic related  to  $ (shown  in  Section  II  here).  This  statistic  is 

(/jj  - /j ^ ) / ( + <r‘)  , where  the  subscripts  "1"  and  "2"  refer  to  the  two 

input  distributions.  For  any  of  the  tables  involving  two  normal  distributions, 
if/jp  yu-p  cTp  and  are  changed  such  that  the  above  statistic  remains  con- 

stant, then  the  output  relative  frequency  distributions  given  here  are  appli- 
cable. 

In  the  case  of  the  gamma  input  distributions,  whenever  the  g's  (scale 
parameters)  are  both  multiplied  by  the  same  factor,  the  output  distributions 
are  still  applicable.  For  triangular  input  distributions,  if  all  parameters 
are  added  to,  subtracted  from,  divided  or  multiplied  by  the  same  number,  the 
output  distributions  will  not  be  changed. 

The  tables  given  here  are  for  two  normal  distributions,  two  gamma  distri- 
butions, two  triangular  distributions  and  finally,  two  beta  distributions.  The 
parameters  were  picked,  in  many  cases,  such  that  the  power  against  the  alter- 
natives was  approximately  0,5  when  the  significance  level  was  approximately  0.1 

to  0.15.  This  occurs  for  H , as  shown  in  the  following  table: 

o 3 
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# of  samples  run  length 

of  size  2 each  under  H 

0 


(probability  level 

under  H ) 
o 


5 

10 

20 

50 


1 

2 

3 

4 

5 


-11% 

-15% 

“15% 

“10% 

“15% 


The  following  tables  provide  a variety  of  examples  of  alternative  hypothe- 
ses and  results  obtained  using  them.  It  Is  hoped  that  this  appendix  Is 
sufficient  to  provide  a working  knowledge  of  the  power  of  this  test  to  Its 
users.  When  any  specific  alternative  which  the  user  is  interested  In  investi- 
gating does  not  appear  here,  and  the  user  does  not  wish  to  spend  the  time  to 
get  the  programs  of  Appendix  III  to  run  at  a convenient  facility,  It  Is  hoped 
that  the  results  can  be  Interpolated  from  results  provided  here.  Note  that 
following  each  graph  of  the  distributions  from  which  the  observations  are 
hypothesized  (H^)  to  have  been  drawn,  only  relative  frequency  distributions  are 

provided.  The  hollow  bar  graphs  show  the  relative  frequency  distribution  under 
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1 2 3 4 S 6 7 8 9 10  11  12  13  14  15  16  1/  18  19  20 


H0:  20  pairs  of  observations  drawn  from  the  same  distribution 
H.:  19  pairs  from  N(x,l)  and  1 pair  from  N(x+2 ,1) 


Hj:  49  pairs  from  N(x,l)  and  1 pair  from  N(x*2,l' 


* •)«*<!  r» 

••  « m ;♦  »j 


2 3 4 S 6 7 8 9 10  11  12  13  14  1$  16  17  18  19  20 

H0:  20  pairs  of  observations  draun  from  the  same  distribution 
Hj:  19  pairs  from  N(x,l)  and  1 pair  from  N(x+3,1) 


49 


1 2 3 4 5 6 7 8 9 

HQ:  20  pairs  of  observations  drav 
Hj:  19  pilrs  from  N(x,2)  and  1 pi 


5 


o 


5 pairs  of  observations  drawn  from  the  same  distribution 


: 4 pairs  from  G(17,<)  and  1 pair  from  G(10,>‘  (for  UR  of  B's) 
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l 

f, 


t 


iiljix 


20  pairs  of  observations  drawn  from  the  same  distribution 
19  pairs  from  G{l0,x)  and  1 pair  from  G(17,x)  - A's 


20  pairs  of  observations  draw  from  the  same  distribution 
19  pairs  from  G(17,x)  and  1 pair  froin  G(10,x)  - B‘s 
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G( 15.x) 


G{23.x) 

♦ ♦ ♦ 


i > < i i i t i t 


• < f i i « 


• - . .tTTt 

o 


• t I 

o 

o 


2 3 4 5 6 7 8 9 10 


10  pairs  of  observations  drawn  from  the  same  distribution 
9 pairs  from  T(x,x*10,x*2)  and  1 pair  from  T(x+5,x+lS ,x*7)  - A‘s 


3 4 5 6 7 8 9 10 

H0:  10  pairs  of  observations  drawn  from  the  same  distribution 
Hj:  9 pairs  from  Ux+5,x+15,x+7)  and  1 pair  from  T(x,x*10,x+2)  - B’s 
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As  stated  In  the  Introduction  to  this  paper,  If  "A"  Is  the  larger  value  and 
"B"  Is  the  smaller  value  of  one  pair  of  observations,  then  the  probability  for 
the  Length  of  Initial  Run  (LIR)  of  "A,ns  or  the  probability  for  the  LIR  of 
"B"'s  can  be  used  to  Indicate  whether  data  has  been  drawn  from  the  same 
distribution  or  from  two  different  distributions. 

In  most  cases,  determining  the  probability  of  a LIR  equal  to  N (where 
N=l,2,3,...)  Is  Impractical  using  analytical  methods. 

The  purpose  of  this  prpgram  Is  to  use  simulation  to  estimate  the  probabil- 
ity of  observing  a LIR  equal  to  N for  an  alternative  hypothesis  that  assumes 
data  has  been  drawn  from  two  different  distributions  rather  than  the  same 
distribution  (as  the  null  hypothesis  assumes). 
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Llij'l  'JK  VARlAoLtS 


r 


Variables  useo  in  subprogram  iM  TlALiZt 


nisi;  value  ,deter  nir.ea  oy  input,  *hici.  speciiies  the  probability 
aistr ioution. 

1 k h h : input  value  tor  the  total  number  of  replications  to  ne  per- 

formed . 

ILL:  Logical  unit  assignment  oasen  on  input  value, 

n A e L : vame  of  output  file  (*hen  ivFiLtsb), 

disp:  Disposition  of  output  tile  at  termination  of  program  (#nen  , 

M-lLL=b).  ^ 

lit*!  value  iA  or  H)  desiunatiny  tne  type  of  run  test  to  oe  performed 

LhllLR:  Specifies  tne  type  of  probaoility  Distribution;  u-normai,  ! 

O-qamma,  B-neta,  or  I-tr i anqu lar . j 


Variables  useo  in  subprogram  SUnf 


SOtf:  Ihe  total  number  ot  samples  ara*n  (f-SA^t  1 ) ♦ tvSA*»FC2) ) • 

AkUk:  Length  ot  tne  initial  run  of  'A#s  before  tne  tirst  ' B ' . 

BkO<v:  Lenqth  ot  tne  initial  run  ot  'h's  before  the  first  'A'. 

AKtiULTlSm) : Array  wnlch  stores  tne  numoer  of  times  each  r A K U r« 

ler.gtn  occurs  durinu  the  entire  simulation, 

oRLShUT (Sob  ) : Array  v*nicn  stores  tne  numoer  of  times  eacn  'brim 

lenqtn  occurs  during  the  entire  simulation. 

Variables  used  in  subprogram  RLROnl 

PtKCfc NT (SLb ) : Relative  frequency  destrioution  of  the  lir 

(ARLSuLJ  (SU'O/IrLR;  I'KtSuul  (Suy  )/lRr-R)  , 

CO(SoK):  Cumulative  frequency  distribution  ot  tne  LIR. 

Ar.s*,t.t<:  Input  value  *hicn  determines  tne  status  of  oar  graon 

output  of  frequency  distribution  (print:itfi  or  i^u). 

Variables  useo  in  subprogram  nORiAL. 

nu(l);  frean  for  distribution  i. 
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SIGMA 1 1 ) ( Standard  deviation  for  distrioution  I. 

X:  An  Independent  identically  distributee  uniform 

random  number  generated  by  tne  VAX  11/780  subprogram  '/am', 

T*  C2X-1):  A test  value  to  determine  acceptaoility  of  the 

generated  random  numbers. 

y:  N*(o,n. 

XI,  X2l  K"(ML,SlGKA)#s. 

Variables  useo  in  subprogram  gamma 

ALPHA C 1 ) 1*1,2:  Shape  parameter. 

beta ( l ) 1=1,2:  Scale  parameter. 

P:  An  independent  identically  distributed  unitors 

random  numoer  generatea  oy  the  VAX  11/760  subprogram  'hak'. 

TEST  i *i  Test  values  used  to  determine  acceptability 
of  the  generated  random  numoers, 

y:  gam“ialpha(1),i)  1=1,2. 

XCl)  1*1,2:  GAM“(ALPrtA(I),btlTA(I)), 

Variables  useo  in  subprogram  bEi A 

alpha ( 1 , J ) 1,J*1,2:  Shape  parameters. 

R:  An  independent  identically  distributed  uniform 

random  number  generated  oy  the  VAX  H/760  subprogram  'HA*'. 

TEST  * a:  Test  values  used  to  determine  acceptability 

of  tne  generatea  random  numbers. 

y:  gam~(alphA(I),i)  1=1,2. 

X1CI)  1=1,2:  GAM" (ALPHA ( 1 ) , 1 ) 1=1,2. 

X2 ( 1 ) 1=1,2:  bET"( ALPHA(l) ,ALPHA(J)J  1,J*1,2. 

Variables  used  in  subprogram  thiang 
G:  Minimum  value.  Location  parameter, 

h:  Maximum  value.  Max-Mln=Scale  parameter. 

C:  Mode.  Shape  parameter. 

R : An  Independent  identically  distributee  uniform 

random  number  generated  oy  the  vax  11/760  subprogram  'pa-m#. 
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X2 ,X J : 1 h 1 AivG“(u , 1 , (mode-n  in ) / (irax-min) ) . 

X ( I ) 1 = 1,2:  rPiAf«G'’(min,riax,mode) . 


Variaoles  common  to  supproqratrs 


i'.SArP(i)  j=i,2:  Tne  number  of  two-ooservation  samples 

reouirea  tor  distribution  I, 

acsum)  su,w  = l,uSMKk'ti)+r^SAHF(2):  Array  ot  greater  values  ot 

eacn  two-ODser vation  sample  from  the  comfcineo  NSAKP'S. 

btsuM)  6UM  = 1 ,.vSAhp(  l )+r.SAi-PC2) : Array  of  lesser  values  ot 

eacn  two -observation  sample  from  tne  combined  NSA."iP 

Cuur-T:  Counter  for  current  number  of  replications;  simula- 

tion terminates  wnen  cuuM=lkfc.P. 

l 1 : initial  value  required  for  ranuom  number  generation. 


PKUGKAIm  L 1 h I T A J i U iv S 


Subprogram  l H 1 A Mj s Tne  '-one  'C'  usea  to  generate  the 

TKlANci (0,1,0,  where  c= (mode-min ) / (max-min ) , is  restricted 
to  0<C<1. 


SuDrrcgram  GA^a:  aLP»>A>1. 

Suoprogra.Ti  efcTA:  ALPn  A ( 1 , J ) > 1 1 = 1,2. 


input  distributions  must  both  be  of  the  sarnie  type. 

input  distribution  type  is  limited  to  normal,  gamma,  oeta, 
or  trlanquiar, 

Tne  number  of  replications  must  be  less  tnan  or  equal  to  100,000. 
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********** *t**«********MAIN  PKaGRAM******************-****»**«J.*lL*llUlJlJl. 


I 


C0MM0N/ALG/A(50)  ,B(50),SUM 

COKMON/ALL/NFILE,IREP,UEM 

COMMON/OUT/ARESULT(50),BRESULTC50),  PERCENT  (50) 

COMMOhI/FILE/NAME,DISP  

CALL  INITIALlZE(DIST) 

19  GOTO! 1 ,2,3,4) , DIST  

1 CALL  NORMAL 

GOTO  10  . 

2 CALL  GAMMA 

GOTO  10  

3 CALL  BETA 

GOTO  10  . 

4 CALL  TRIANG 

10  CALL  REPORT  . ..  

STOP 

END  


********************************************************************* 

* . I 

* Tnis  subroutine  Inputs  the  type  of  probability  * 

* distribution  and  the  number  of  replication!.  » 

* * 
»♦»»***♦*♦♦♦♦♦»♦♦♦♦♦»»»»»»»**♦♦»*♦*»♦*♦♦*»»»»*»**»»»»***»»♦»»»♦»»»»♦»» 


SUBROUTINE  INITIALIZE(DIST) 

CUM*0N/ALL/NFILE,1R£P,ITEM  .... 

CUMrtOu/FILE/NAME,DISP 

CHARACTER**  LETTER /.CHOICE  

CHARACTLR*1 3 NAME , DISP 

35  wRITECb, 313  

31  format(T2, 'Output  may  be  written  to  a file  and  than  printed  *, 
/'or  printed  at  this  terminal. ' ,/.T2, 'Enter  "F"  for  flit  or  "T"», 
/ ' for  terminal,') 

READ(S,79)CH0ICE  

IFCCHOICE.EO. 'F')THEN 

NFILEsB  . ‘ 

ELSE  IFCCH01CE.EQ. 'T')THEN 

NF1LE=6  

GOTO  3 

ELSE  

GOTO  3b 

END  IF  

wRITE (6, 12) 

12  F0RMATCT2, 'Enter  the  name  of  the  ..output- tilfttAl 

REAO( 5 ,17 ) NAME 

17  t OPMATC  A)  ...  

RRITE (6 , 21 ) 

21  F0Rmat(I2, 'Select  the  disposition  of  the  output  

/ / , T2 , 'Type  t KEEP, PRINT, or  PRINT/DELETE,') 

PEAU(0,27)DISP  

27  FORMAl(A) 

open ( a, FILE*NAME, STATUS* 'NEW ',DISP«DISP)  

3 .vKITt(6,72) 

72  forma i ii2, 'which  type  of  test  Is  to  be  perl orjuedl'*/^!,  

/ 'tnter  "Arun"  If  you  wish  to  Know  the  length  of  the  A ', 

/ 'run  before  the  first  B. ',/, ' Enter  ".BRUN"  11  you  JLi  tH.  th*V  -- 
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13 

10 


23 


92 

93 

91 


/ ' lenotn  ot  tne  h run  before  the  first  h.  ' ,/,l’2, 'Enter  "BOTH* 

/ ' it  vou  would  like  tne  results  of  Doth  tests.') 

&EAD ( 0 , 7 9 ) TES 1 
79  KUNbAT(A) 

If  U£Sl.£Q.'MrfU1.')THfc> 

lTtN.5l 

fc.LSt  IFdKM.EO.  'h«UN')lHfcM 
i T c.  h = 2 

fc.L«St  1FCXEM  .LU.'bUTH')ThKK 
1 T E ’ s 3 
i.  L S t. 

unit.  3 
tno  if 
.*•  (<  t i E ( h , 1 o ) 

f UK  iA'i  (12 , 'select  the  type  of  distribution  you  would  ', 

/ 'like.',/,'  Type  tne  first  letter  of  the  name  to  mane  ', 

/ 'your  input.') 

RtAl>(b,2J)LLTTfc:K 
FOKhAT(A) 

i F ( i> fc. '1  T h K . h! u . 'N')THKu 
0 1 6 T s 1 

ELSE  iF(  LETTKk  . Ki» . '<;')!  riK.i. 
i)  I S 1 = 2 

F L S r.  1 F 1 1,  f T 1 1 . « • t.  u . ' i? ' ) X n K 
UIbl=3 

ELSE  lF(Lr.l  f EN  .Ef, . ' T'Hdt-. 

dlSIS'l 
P bSt 
GOlU  13 
K . 0 I F 
t<  K 1 1 E C n , 9 3 ) 

FOP  !a i(X2, 'Enter  the  number  ot  replications  ', 

'to  be  performed.') 

*Mb(5>,*)  JkKP 
If (1PEP.GT,  1OO000  )(iOTu  92 
P e.  t v n ' 
f \ u 


*+*4***+***********************?********************************** 
* * 

* This  subroutine  aenerates  random  numbers  * 

* from  tne  normal  distribution.  * 

* * 
****************************************************************** 


Su'iKOUj'i.iC  IjUkmAF 

Q 1 E iv M I J '■  V ( 2 ) , Ml  ( ? ) , .Si  <i  *A  (2  ) , hSAMPC  2 ) 

CU*'»C  J/AMj/A(bO)  ,nfbU),SU* 

CU'-  i n -/ALL/fiFlbt;,  IKEP , 1 1 PV. 

C')  *:>0.</F  I LE/-N A<*E,  U1SF 
CdAHACTf.H+13  nAME,[.’1SP 
PEAL  nit 

J ;■<  T ka»  t.R  C dll •» T , 1 i , !i ll  ii 
ciiu.nso 

1F(  FILP .£0.6) GOTO  o 

liPE*'  C «,  FiLE=. "lA-lE.bl  ATybs'ULO'  ,DIoP=DI&F) 

PP1M  5 

F’jp* AT (IX,  'Enter  mu,  sloma  and  number  of  samples', 
' for  tne',/, 12, 

'first  normal  distribution  separated  oy  commas,') 
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• r-'-m 


REAQ(5,*)MU(1) ,SIGMA(1) ,NSAMPCt) 

WRITE(6,17) 

...17  KQRMAICT2, 'Enter  mu,  sigma  ana  number  of  samples', 

/ ' for  the',/,T2, 

. - / 'second  normal  distribution  separated  by  commas.') 

READ(!j,*)MU(2)  ,SIGMA(2) , NSAMPC2) 

WRIIEC6, 19) 

19  F0RRATCT2, 'Enter  a five-digit  number  for  random', 

/.  ' number  generation.') 

REA0(5,*)I1 

. *R1TE(NFILE,90)MUC1),SIGMA(1),MU(2),S1GMA(2),NSAMP(1), 

/ MSAMPC),I1 

98  F0RMATCI2, 'input  distributions  are  N**(  ',F5,2, ',  ',F5. 2, ')  and', 
/ ' N"(',F5,2,','F5.2,').',/,T2,'Number  of  samples  from  eacn  ', 

J ..'distribution  is  ',12,'  and  ', 

/ 12,'  respectively. ' ,/,T2, 'The  random  number  seed  for  this  ', 

/ 'run  is  ',16,'.') 

97  C0UNT=C0UNT+1 

..  . J*0 

SUM  = 0 

1.0  Q J*dtl 

N»0 

...  90  ...  TSQ 

DO  BO  1=1,2 

..  X=RAN(11). 

V ( I )*2 . *X-1 . 

W*.V.CI)*VII  1. 

T=T+W 

.80 CONTINUE 

IFCT.GT.l . )GO  TO  90 
_.  .....  XlSMGtTl 

Y2«-2.*Y1/T 

:i=5QRTCY2)  .... 

N=N  + 1 

_.... 5UMS5UM+1 

X1=MU(J)+S1GMACJ)*V(1)*Y 

_...  X2*MUIJ)+SIGMA( J)  *V  (2)  *Y 

IF(X1,GT,X2)THEN 

A(8UMJ*X1  

B(SUM)sX2 

ELSE  . 

A (SUM)=X2 

. 0 (SUM) =X1 

fcNDIF 

LECN.LT.NSAHPCJ) )GD  TO  90 

IF(J.LT.2)GQ  TO  100 

CALL  SORT 

1F(COUNT.LT.1REP)GOTO  97 

RETURN 

END 

M*  ******  **  ********************************  ***«!»***$  ******  ************ 


* * 

* This  subroutine  generates  random  numbers  * 

* from  the  gamma  distribution.  * 

* * 


*************************  *********  ***************************  ********* 


SUBROUTINE  GAMMA 

DIMENSION  U(2) ,X(2)  ,ALPHA(2) ,BE1A(2),NSAMP(2) 
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h 

10 


8 

1 1 


/ 

/ 


12 


<ib 


/ 

/ 


^7 


t>0 


8 0 
?0 

bu 


.-  v 


Cd"  ’•;,  i/ma./ A ( bO ) , h ( dO ) ,Ptli' 
Ct'i'-U  v/AJ.l./  iHl.fc,  IkrP, 1 it 


C«w».y-./r  lLfc/r*A'-.t,U16P 

C n Aw AC T t H * 1 J t.af'F  , L X ii 
JUT&C.F.M  CuOnT,  11  ,SU/( 

PbAO  LTbKrA,Li,»Z 

cuii 

lr(  'FlLr. .hti.bJOOlli  o 

C- Hi  ILfsf-A'-'b'.bl  Alo.S='ULD',tiiaPs01SPJ 

kklr.T  1 (. 

#•  l»h  - Alt  1A, '►r.ter  alpna,  ceta  a no  number  ot  samples  tor'# 

' tnc',/,12, 

'first  t. a .* m h distribution  separated  ny  commas.') 
kfcAii(b,*)eLHtiA(l)  ,hkTh(  1 J ,..5AKPU) 

IF  (Alwb«U}.lt.l)UlIO  n 
F*l  7 11 

Fur  AllU.'tPter  alpha,  oer.a  am  number  ot  samples  tor', 

' C<ie*,/,‘i2( 

'secono  jamra  nistr lout  ion  separated  ty  commas.') 
PtAd(S,«)ALPbA(2)  , br.l  “(  * ) ,»i>A‘’F(2) 
ll,lrtL8nAl^),Lt.  1)00  lo  n 
Pal. I 12 

fue  A i ( 1 1 , 'i-.nter  a tive-dlqit  bumper  for  random', 

' numoer  veneration.') 
htA j(b,*)ll 

.Hi  f - ( I l IF, ^8 ) A LPH A ( 1 ) ,HtTAll),ALPHAl2)  ,BtTA(2)  ,NSA«HU), 
lift  H 2 ) , i 1 

Fob  A i (12 , 'Input  distributions  are  G“l ' ,Fb. 2, ' , ' ,K5, 2, ')  and', 
' ft' ,fb.2, ', 'F*b.2 ,').',/ ,T2, 'Number  of  samples  from  ', 

'eacf.  -iistriout  Ion  Is  ',12,'  ana  ', 

12,'  respectively T2 . 'Tne  random  numoer  seed  for  this  ', 
'run  is 
lbtlM=4.b 
I 1 *i5.7/lSL-lol4.b) 

1 = 1 ,+Llnc.lA 
F tl  lir<  s OQO  (.  4 , ) 

CUUVJsCuU'.'T+l 
S u ■'  = 0 
F=0 
F = K + l 

Gl  = 2.*ALPr;At*  )-l  . 

G2  = oQr  f t o l ) 

G — 1 . / G 2 
psApPiiAlM+G/ 

H = AuPr  -.At  b ) "Fl'l  b 
r-o 

J=0 

J=J*1 

l U 30  1=1,2 

p = p « 1 1 n 
o C I )=p 

Cl",  I 1 .tip. 

Y l=u(l)/(  1 ,-LU  ) ) 

V'2  = oOG(  Vt  ) 

v=r.u:2 

Y i = h.  x P ( V) 

Y = A L p A t M * Y 1 
Z=0l2)*u(l)*0ll) 


i = F t u * y « v 
! 1. 1>  i - .’.  H * i r*  r r A * t. 


.1  V f i L.;  f . Gr  . w . ) • 
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LNZ*LLGU) 

It U ,G£.LNZ)GO  TO  40 
GU  TG  50 

40  X ( J) ay*BETA(K) 

1F(J.LT.2)G0  70  20 
5UMaSL)H+ 1 
N*N  + 1 

1F(X(1),LT,X(2))THEN 

A(SUM)*X(2) 

8(SUM)eX(l) 

ELSE 

ACSUM)aX(l) 

B(5UK)*XC2) 

END  IF 

1F(N.LT.NSAMP(K))G0  TO  UO 
IFCK.LT.2)GO  TO  60 
CALL  SORT 

IF ( COUNT. LT . IREP) GOTO  97 

RETURN 

END 


*****************************  ******  *J  MM  *1 

* * 

* This  subroutine  generate*  random  number! * 

* from  the  beta  distribution,  * 

***************************************************  *~*  * V*  * *Y*~**  * * * * ****♦" 


SUBROUTINE  BETA  

DIMENSION  X1(2),X2(2) ,U(2) , ALPHA (2,2) ,NSAMP(2) 

CORMON/ALG/A(bO),BC50),5UM  . 

CONfcCN/ALL/NFILE, IREP, ITEM 

C0<>  mon/HLE/NAME,DISP  . 

CHARACTER413  name,disp 

real  ltheta,lnz  

INTEGER  COUNT, 11, N,0,SUM 

COUNTsO  

IF ( NF ILE, EQ,6)G0TU  3 

OPEM6,FILEsNAME,STATU5*'OLD',DI5PsPISP)  

3 PRIM  10 

10  format ux,  'Enter  aiphai,  aipha2  and  number  of  ssmple_*__',  

/ 'for  the',/,T2, 

/ 'first  beta  distribution  separated  by  commas^') 

RE AD (5,*)  ALPHA (1,1), ALPHA (2,1) ,NSAMP(1) 

PRINT  U 

11  FOR RAT (IX, 'Enter  aiphai,  alpha2  and  number  of  samples  ', 

/ 'for  the',/,T2,  ...  ..  

/ 'second  beta  distribution  separated  by  commas,') 

READ(b,*)ALPHA(l,2),ALPHA(2,2),N5AHPC2)  . 

PR  I ;T  12 

12  FURf at( T2, 'Enter  a five-digit  number  for  random',... 

/ ' number  generation,') 

READ ( b , * ) 1 1 

WRITE (NFILE, 9b) ALPHA( 1 , 1 ) , ALPHA ( 2 , 1 ) , ALPHA! 1 ,2) , ALPHA (2,2) , 

/ FSAMF(l),tvSAMP(2),Il  . 

fuRiiAi(T2,  'Input  distributions  are  B“(',F5,2,',',F5,2,')  and', 

/ ' B*’(',F5.2,','F5,2,'),',/,T2,'Number  of  samples  from', 

/ ' eacn  dlstrlDutlon  is  ',12,'  and  ', 

/ 12,'  respectively. *./,T2,  'The  random  number  seed  for  this  ', 
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'run  is  ',1b,'.') 

TilK.  I'As'i.b 
LJ  Hfcl  ASUKH4.5) 
f =l  .41.1HMA 
(■J  UR*  LOG  (4.) 

C‘iUr'isCuu^T+1 

CsO 

55  U - 0 

Do  yo  i-.sl,2 

CfSlHl 

l,Q  if  0 .v  = l ,I.,SAMP(I.) 

SUMsSI/i«.  + 1 

C'J  70  t=J,2 

IV'  h'J  K = 1 ,2 

01  = 2 • * ALPnACK , N ) " 1 . 

G2-but(  l ( G 1 ) 

G = 1 • / G 2 

UsAl.FnAtl-  , <0+G2 
HsALF  r;A  ( K , A ) “FOUh 
l»0  JC  i = 1,2 
RchAMll) 
lli)sp 
CbMI.iUL 

Vl«u(l)/(1,-U(l)) 

V2=bQGC vl ) 

v=r,»v2 
n*tXf c v > 

YsALPHA(K,t.)*Yl 

Zsi'C2)*U(l)*U(l) 

*<  S H ♦ G * V " Y 
TLS7  s,-,  + it-lhf.  TA*Z 
IK(IL'M.Gfc.O.)GU  II  40 
L !» Z = LG  G ( Z ) 

)F  U.GF .L  ,Z)GO  TU  40 
GO  TO  50 
XI  (*.)«¥ 

Cbf<J]NUt 

X2(L)=X1(1)/(X1(1)+X1(2) ) 

CU'J  J lixUt. 

IF  ( a2(1  ) ,c;T.X2(2)  }THt  » 

A C Si'** ) =X2 ( 1 ) 
hlSO  l)=X2(2) 

F ISt. 

A ( 5 u ;•'• ) s A 2 ( 2 ) 

FUSU")=X2(1 ) 

F . GIF 
C'Jl'  I 1 .>•  0 L 
CO.  * 1 Li-. 

CALL  60hT 

IF  (Cf'oM.LT.J l<kJP)GGTU  97 
FtTGb?: 

L;.G 


***♦*»*»*»*♦*:»*♦****************♦***********♦**♦****♦***********♦**** 
* * 

* inis  subroutine  generates  ranaom  numbers  * 

* fro.n  tne  triar.oular  distribution.  * 

* * 

9* **«****«***********+******»»* ************************************** 


SUBROUTINE  TH1ANG 

LilM£liSJ.Qk.Jl(2X4aC2J^iU2),C(20#iii»AMPC2).  ..  

COMMON/ ALG/A (50) ,BC50),SUM 
COM MON/ ALL/N FILE » 1 REP, ITEM 
COMMON/FILE/NAME,DISP 
CHARACTERM3  NAME , DI5P 
INTEGER  COUNT, II, SUM 

CAimiffi. 

IF(NFILE,E0.6)G0T0  9 

OPEN  C&,FILE*NAME,  STATUS*  'OLD',  D1SP«DISP) 

9 PRINT  50 

50  FOFMATUX, '.Enter  minimum,  maximum,  mode  and  number  o £ 

/ 'of  samples  from  tne',/,T2, 

Z..__'±irst  triangular  distribution  separated  by  commas.*} 

READ(5,*)G(1),H(1),C(1),NSAMP(1) 

13  PRINT  51 

51  FORMATUX,  'Enter  minimum,  maximum,  mode  and  number  of  ', 

t *_of  samples  from  the',/, 12, 

/ 'second  triangular  distribution  separated  by  commas.*) 

R£AM5,!)Gt2),HI2},CC2J.,USAMPC2)  - 

IF(G(l),GT.H(l).OR.G(l).GT,C(l).QK,C(l),GT,H(l))THEN 
MR1TE(6,7) 

7 FOKMATCT2, 'INCORRECT  PARAMETERS  ON  DISTRIBUTION,', 

..  TRY  AGAIN.') 

GOTO  9 

ELSE  1F(GC2).GT.H(2).QR.G(2).GT.C(2).0R.C(2).GT.H(2))THEN 

WRITE (6, 7) 

GOTO  13 

ELSE 

PRINT  52 

52  FORMATCT2, 'Enter  a five-digit  number  for  random  number', 

L '..generation.')  ...  

READ15, *) II 
END  IF 

WRITECNFILE,98)G(1),H(1) ,CC1)  ,G(2)  ,H(2),C(2),NSAMP(1), 

. / NS AMP (2) ,11 

98  FURMAT(T2 , 'Input  distributions  are', 

1!X'*.F5.2, ' , *,F5.2, ', *,F5.2, ')  And',  

/ ' T-(',F5.2,',',F5.2,',',F5.2,').', 

/ /,T2, 'Number  of  samples  from  each  distribution  is 

/ 12,'  and  ',12,'  respectively.', 

/,T2,'Tne  random  number  seed  for  this  run  is  ',16,'.') 

97  COUNT=CUUNT+i 

JUMsfi.  . 

DO  30  1=1,2 
T«HII)-GCI) 

COMP=(C(I)-GU))/T 
COMP1=1,-COMP 
DO  20  Lsl,NSAMP(I) 

SUM=5UMtl  . 

DO  10  J=1 , 2 
UsRANtli) 

IF ( U . LE .COMP) THEN 
XlsCOMPWU 
X2*S0RT(X1) 

......  X (J)=G( I)+T*X2 

ELSE 

Xl=COMPl*(l,-U) 

X2=SORT (XI ) 

X3*l «-X2  

102 


xcj)s(i(i)  + r*x3 

EwDlF 

10  CQNTIrtiit 

IF(AC1).GT.X(2))THEN 

A(Su<-*)=XCl) 

B(SLf')=X(2) 

ELSE 

A (SU *')=A(2) 

B(SUiOsXCl) 

EnDIF 

20  CON  f I:\Ufe. 

30  CONTINUE 
(.'ALL  SORT 

I F ( COUN  I , liT , IKEP)GUTO  97 

RETURN 

END 


tfV********************************************************************* 
♦ ♦ 

* inis  subroutine  performs  a Duoble  sort.  * 

* * 
tfyftf****************************************************************** 


SUBROUTINE  sukt 

or  V ELS  ION  AA(2),BB(2) 

CU/t  iCN/ALG/ A ( 50) , B ( 50 ) , SUM 
CQ^'O'i  / A LL/uF  1 LE , 1REP , ITEM 

CO«  v.c;, /OUT/ A RESULT  (50)  ,B?ESULf (50)  , PtKCtM(bO) 
CL»f"FO.v /FILE/ NAME,  LISP 
CHAKACTERM3  l'<AME,i)I8R 
[ NT EGER  m R U L , B R U f , S U M 
12  IVItHAsi 

DO  I = l,(6UM-n 
IF  l A ( I ) « GE , A ( 1 + 1 ) )GU TO  ?b 
AA(2)sA(I) 

AA(1)=A(I+1) 

A(1)CAA(1) 

Atl+1 )=AA(2) 

InTERAsO 
25  CONI  I N U E 
EuD  DU 

IF ( INTER A , Ey . 0 ) GOTO  12 
9 JNTERtt=l 

DO  J = 1 , (SiM-i ) 
l F ( B ( J ) , GE . ri ( 0+ 1 ) ) GOTO  27 
HR (2) (01 
bti(  l )=B(  J + l) 

»(J)=BB(i) 

«(J+1 )sBb(2) 

I r«  T E R ti  s i.) 

27  CONTINUE 

f-ND  DO 

IFC INTER h.EO.OJGOTU  9 
ARUusO 
RHU.sO 
DO  1=1, SUM 
I F ( A ( i ) , G1  . T*(  1 ) )Ti*t« 

ftKlJuS  ARUN  + 1 
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ELSE 
GOTO  7 
END  IF 
END  DO 

7 CONTINUE 

DU  l = StJM,l,-l 
IF!B!I).LT,A!SUM))THEN 
BRUNabKUN+l 
ELSE 
GOTO  18 
END  IF 
END  DO 

18  CONTINUE 


**************************** ********** 

* End  of  replication  bookkeeping!  * 

* This  part  of  the  subroutine  stores  * 

* the  statistic  "I  of  runs  of  length  * 

* X"  in  the  array  RESULTCX)  *_ 

************************************** 


DO  1*1, SUM 

IF(AKUN,EQ,I)ARESULTII)aARE5ULT(I)tl 
IF(BRUN.EQ.I)BRESULT(I)*BRESULT!I)+1 

END  DO 
RETURN 

END  

♦*♦»»♦♦♦♦♦♦»»*♦♦♦*»♦*»»*♦»*»»♦♦*»***»**»»*»♦*»*»**♦*»*»*»****»***»*****»< 
* * 

♦ Tnis  subroutine  generates  the  _oiitJJiu.t_jLf * 

♦ the  simulation,  * 

♦ ....... * 

************************************************************************* 


SUBROUTINE  REPORT  . 

Dimension  AA(2) , BB( 2), CUM (50), INI (50) 

CO*MON/ALG/A!50),B150J,SUM  --  

CQMWON/ALL/NFILE,IREP,ITEM 

CQMmQN/OUT/ARESULT  (503  , BRESULT!  501,  PERCENT  CSGD 

COMMON /FILE/NAME, DISP 

CHARACTER*1 3 NAME , DISP , ANSWER  . 

INTEGER  SUM 

IF!NFILE.EQ.6)GOTQ  11  . 

OPEN (8, FI LE=NAME, STATUS* 'OLD', DISP«DISP) 

11  IFdTEM.EO,  1 ,OR. ITEM, EQ. 3) THEN  . . 

ASSIGN  37  TO  IOUT 

ELSE  . 

ASSIGN  39  TO  IOUT 

END  IF  

*RITE(NFILE,IOUT) 

37  FORMAic///, T2, 'Test  for  the  Length  of  the  Initial  Rim  .01% 

/ ' A ' ' s before  the  first  B,',/) 

39  FORMAT!///, 12, 'Test  for  the  Length  of  the  Initial  Juo_oi  % 

/ '»"s  before  the  first  A,',/) 

«RIIE!NFILE,13)IREP  ....  

13  format ! 12 , 'Number  of  Replications!  ',16,/J 

IF (ITEM, EQ ,1,0k,  1TEM.EO,  3)THEN  

rtRHE(NFILE,42) 

42  FORMATIT2, 'Length  4% 'Observed  %4X, 
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/ ’'Relative  ',4X,  'Cumulative') 
wRlTE(NFILE,44) 

44  FORMAT (T2 » 'of  Run * ', 4X , 'Frequency * ', 4X, 

/ 'Frequency*', 4X, 'Frequency*') 

T*IREP 
CUM ( 0 ) *0 , 0 
DO  1*1 , SUM 

PtRCENT(I)*ARE5ULT(I)/T 

CUM(I)=CUMCI-1)+PERCENT(I) 

END  DO 
DO  1*1, SUM 
INT(I)*ARESULT(I) 

WRITECNFIIF,17)I,INI(I),PERCENT<I),CUMCI) 
17  FORMAT(T2,IJ,T12,I6,T28,F8.6,T42,F8,6) 

END  DO 

wRIT£(NFILE,62) 

62  FORMAKT2, /,/,/,/) 

CALL  GRAPH 
IF ( ITEM , EG.  1 ) GOTO  57 
ITEM*2 
GOTO  11 
ELSE 

WRITE ( NFILE, 42 ) 

WRITE ( NFILE, 44) 

T=IREP 

CUM(0)=0.0 

DQ  1*1, SUM. 

PERCENT(I)=BRE’SULT(I)/T 
CUM(I)*CUM(I“1) t PERCENT ( I ) 

END  DO 
DO  1*1, SUM 
InTCI)=BKESULT(I) 

\ WRI1E(NFILE,17)I,INT(I),PLRCENT(I),CUM(I) 

END  DO 

WRIT£(NFILE,62) 

CALL  GRAPH 
END  IF 

57  CONTINUE 

. RETURN 

END 


*******  ******  *********** ***************************************** ****** 


4 • 

* This  subroutine  generates  the  relative  and  * 

* cumulative  frequency  distribution  * 

* function  graphs,  * 

4 4 


*********************************************************************** 


SUBROUTINE  GRAPH 
DIMENSION  TEST1 (50) ,CUM(50) 

COMMON /ALG/A{ 50) , BC  50 ) ,5UM 
COM  MOi» / ALL/ N FILE, IREP,  ITEM 

COMMON/OUT/ARESULT(50) ,HRESULT( 50) , PERCENT (50) 

COMMON/FILE/NAME, DISP 

CHARACTER*13  NAME, DISP , ANSWER 

INTEGER  SUM 

lF(NFILE.EO,b)GOTO  13 

0PEN(8,FlLEsNAME,STATUS=_'0LD',DISP*DISP) 
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""  '•'4*** 


13  IK  ( ITEM. . EU  . 2 ) THEN 

ASSIGN  99  10  L 

ASSIGN  88  TO  K . 

ASSIGN  84  TU  M 

ELSE 

ASSIGN  98  TO  L 

ASSIGN  86  10  K ...  .. 

ASSIGN  82  TO  M 

END  IK  ..._  

*RITE(6,L) 

98  format ( T2 , 'For  this  test  of  the  length  of  the  run  of ',  _ 

/ ' A"s  before  the  first  B,',/,T2, 

/ 'do  you  wish  a oar  graph  of  the  relative  and'# 

/ ' cumulative  frequency  distributions?'  ,/»T2# 

/ 'Enter  "US"  or  "NO" , ' ) . 

99  FURMAKT2, 'For  this  test  of  the  length  of  the  run  of', 

/ ' b " s before  the  first  A,',/,T2,  ...... 

/ 'do  you  wish  a bar  graph  of  the  relative  and', 

/ ' cumulative  frequency  distributions?', /,T2,  

/ 'Enter  "YES"  or  "NO",') 

RtAD(b,91)ANS>VER  . 

91  FOR. v A 1(A) 

IKCANSwt.H,EO,'NO')GOTO  89  ........ 

*hl it (NFILE.27) 

2/  FURN A l ('1')  

WRItfc INFILL, K) 

86  FOR' ai (15,  'Relative  frequency  graph',  

/ ' for  the  Length  of  the  Initial  Run  of', 

/ ' A"i  before  tne  first  B.',/)  . 

88  format (Tb,  'Relative  frequency  graph', 

/ ' for  the  Length  of  the  Initial  Run  of',  ... 

/ ' B ' ' 5 before  tne  first  A.',/) 

wk1IE(nFILE,37)  

37  RuR.vATCTb,  'U  . 00  ' , T10 , ' . 04  ' , T1 4 , ' . 08  ' , T1  B , ' , 1 2 ' ,T22 , ' , 1 6 ' , 

/T26,  ',2u',T30,  '.24',T34,'.28',T36,'.32',T42,'*3&',1464.'afl', 
/TbO,  '.44',Tb4,'.48',Tb8,'.52',T62,'.56',T66,'.60',T70,',64', 
/T7  4,',68',T78,'.72',T82,'.76',T86,  ',80',T90,'.M',T54,'»i.8', 
/T98, '.92',T102,'.96',T106, '1,00') 

31  WRITE ( N FILE, 33 ) 

33  FORMAT (T6 , 

/ ' + - + - + - + - + + + + - + “ + - + - + + 

/ 

CURIO) =0,0 
DU  1=1, SUN 

TESTl(l)  = (PEKCEM(m.QOb)*lQQQ.  ..  

J = T E S T 1 ( I ) / 1 0 . 

1FCJ.EU.0)IHEN 
ASSIGN  4 3 Tu  I DOT 

ELSE  . ... 

ASSIGt,  41  TO  100T 

END  If  

CUM(1)=CUM(1-1 )+FE8CENT(I) 

*RI TE(.»E1LE,  IOUT)I,PERCENT(I) 

41  FOkFATC  '0',T3, 12,T6,  '-',<JX1H+)  ,2X,F8,3) 

43  FOhrATC'0',T3,12,T6,'-',2X,F8,3) 

END  DU 

wP.lIf  (NF1LE.27) 
a RITE  I'-' FILE, Ml 

82  FURma  ni'b, 'Cumulative  frequency  graph', 

/ ' tor  tne  Length  of  the  Initial  Run  of', 

/ ' A "s  before  the  f liLSi__», ' ,/ ) 
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84  FORMAT (15, 'Cumulative  frequency  graph', 

/ ' for  the  Length  of  tne  Initial  Run  of', 

/ ' B"s  oefore  the  first  A.',/) 

WRITE ( NFILE, 37 ) 

WRITE (h FILE, 33) 

00  lei, SUM 

TEST1(I)=(CUM(I)+.0Q5)*1U00. 

JaTESTl (I)/10, 

IF ( J . EQ . 0 ) THEN 
ASSIGN  43  TO  IOUT 
ELSE 

ASSIGN  41  TO  IOUT 
END  IF 

WRITE(NFILE,IQUT)1,CUM(I) 

END  DC 

WRITE (NFILE, 27) 

89  CONTINUE 
RETURN 
END 


*********************************************************************** 


107 


* 





I 


APPENDIX  IV 


ACKNOWLEDGEMENTS 


Thanks  are  due  to  Ms.  Kimberly  S.  Druschel,  Mr.  Ronald  L.  Fischer,  and  others 
at  Fort  Lee,  VA,  for  support  and  helpful  conversations. 


APPENDIX  V 
REFERENCES 


1.  Downs  and  Cox,  "The  Probability  of  Motor  Case  Rupture,"  presented  at  the 
20th  Conference  on  Design  of  Experiments  in  Army  R,D&T,  October  1974. 

2.  Knaub,  "Design  of  a Multiple  Sample  Westenberg  Type  Test  for  Small  Sample 
Sizes,"  presented  at  the  27th  Conference  on  Design  of  Experiments  In  Army 
R.D&T,  October  1981. 

3.  Law  and  Kelton,  Simulation  Modeling  and  Analysis,  McGraw-Hill  Inc.,  1982. 


ADDENDUM 

After  listening  to  the  presentation  of  this  paper,  Dr.  W.  J.  Conover  of  Texas 
Tech,  commented  that  perhaps  a rank  test  based  on  the  rank  sum  of  the  A's  or 
some  other  appropriate  measure  might  be  used  as  a more  powerful  test  of  overall 
compatibility  among  the  samples  with  emphasis  on  shift  of  location.  In  view  of 
the  weakness  of  the  modified  Westenberg  test  for  the  median  which  is  given 
here,  and  the  generally  high  power  of  rank  tests,  this  suggestion  seems 
promising.  I would  still,  however,  suggest  the  LI R test  for  the  purpose  for 
which  it  Is  intended:  It  emphasizes  the  improvement  which  can  be  obtained  by 

exclusion  of  a particular  sample.  However,  beware  of  repeated  deletions.  The 
probability  levels  of  the  test  change  step-wise  with  each  application. 


